Statistical Analysis of Genetic Associations

نویسنده

  • Dmitri V. Zaykin
چکیده

ZAYKIN, DMITRI V. Statistical Analysis of Genetic Associations (Advisor: Bruce S. Weir) There is an increasing need for a statistical treatment of genetic data prompted by recent advances in molecular genetics and molecular technology. Study of associations between genes is one of the most important aspects in applications of population genetics theory and statistical methodology to genetic data. Developments of these methods are important for conservation biology, experimental population genetics, forensic science, and for mapping human disease genes. Over the next several years, genotypic data will be collected to attempt locating positions of multiple genes affecting disease phenotype. Adequate statistical methodology is required to analyze these data. Special attention should be paid to multiple testing issues resulting from searching through many genetic markers and high risk of false associations. In this research we develop theory and methods needed to treat some of these problems. We introduce exact conditional tests for analyzing associations within and between genes in samples of multilocus genotypes and efficient algorithms to perform them. These tests are formulated for the general case of multiple alleles at arbitrary numbers of loci and lead to multiple testing adjustments based on the closing testing principle, thus providing strong protection of the family-wise error rate. We discuss an application of the closing method to the testing for Hardy-Weinberg equilibrium and computationally efficient shortcuts arising from methods for combining p-values that allow to deal with large numbers of loci. We also discuss efficient Bayesian tests for heterozygote excess and deficiency, as a special case of testing for Hardy-Weinberg equilibrium, and the frequentist properties of a p-value type of quantity resulting from them. We further develop new methods for validation of experiments and for combining and adjusting independent and correlated p-values and apply them to simulated as well as to actual gene expression data sets. These methods prove to be especially useful in situations with large numbers of statistical tests, such as in whole-genome screens for associations of genetic markers with disease phenotypes and in analyzing gene expression data obtained from DNA microarrays. STATISTICAL ANALYSIS OF GENETIC ASSOCIATIONS

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Gene-Gene Interaction Study Between Genetic Polymorphisms of Folate Metabolism and MTR SNPs on Prognostic Features Impact for Breast Cancer

Background: Breast Cancer (BC), the second leading cause of cancer mortality after lung cancer and varied across the world due to genetic and environmental factors. In this study, we evaluated the interaction between the polymorphisms in genes encoding enzymes of folate metabolism: methylenetetrahydrofolate reductase (MTHFR), methionine synthesis reductase (MTR) with the BC prognostic factors. ...

متن کامل

Unveiling the genetic loci for a panicle developmental trait using genome-wide association study in rice

Panicle size has a high correlation with grain yield in rice. There is a bottleneck to identify the additional quantitative trait loci (QTL) for panicle size due to the conventional traits used for QTL mapping. To identify more genetic loci for panicle size, a panicle developmental trait (LNTB, the length from panicle neck-knot to the first primary branch in the rachis) related to panicle size ...

متن کامل

Association of Obesity Related Genetic Variants (FTO and MC4R) with Breast Cancer Risk:a population-based case-control study in Iran

Background: The heterogeneous breast cancer is the most common cause of cancer-related mortality. Obesity defined by BMI is known as a major risk factor for breast cancer. Objective: The purpose of this study was to explore the role of obesity related-polymorphisms rs9939609 FTO and rs17782313 MC4R in breast cancer development. Materials and Methods: We obtained matched peripheral blood, serum ...

متن کامل

Investigating the rs2237892 and rs231362 Polymorphisms of KCNQ1 Gene Associations with Type 2 Diabetes in an Iranian Population (Yazd Province)

Objective: Type 2 diabetes (T2DM) is a worldwide prevalent metabolic disorder and the cause of many morbidities and mortalities. KCNQ1 gene encodes α-subunit of voltage-gated potassium (K+) channel which plays a role in insulin secretion in the pancreas, thus its variants may confer susceptibility to diabetes. Recognition of genetic variants involved in T2DM could help the early diagnosis and p...

متن کامل

Optimal Feature Extraction for Discriminating Raman Spectra of Different Skin Samples using Statistical Methods and Genetic Algorithm

Introduction: Raman spectroscopy, that is a spectroscopic technique based on inelastic scattering of monochromatic light, can provide valuable information about molecular vibrations, so using this technique we can study molecular changes in a sample. Material and Methods: In this research, 153 Raman spectra obtained from normal and dried skin samples. Baseline and electrical noise were eliminat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999